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CIainp39: The isolated nucleic acid molecule of claim 110, comprising SEQ ID 

NO: 8. 

ClainU<J£>> The isolated nucleic acid molecule of claim 110, consisting of SEQ ID 

NO: 7. 
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Claim The isolated nucleic acid molecule of claim 110, consisting of SEQ ID 

Claim Vp? The isolated niftffiic acid molecule of claim 110, consisting of SEQ ID 



NO: 8. 



Claim U3r The isolated nucleic acid molecule of claim 110, consisting of SEQ ID 



REMARKS 

The preceding amendment is presented in accordance with 37 CFR § 1.121(h). 
Claims 1 10-143 will be pending. 

The amendment address points 3 and 10 of the office action. The objections to 
claims 108 and 109 are obviated via the sequence listing. Applicants note that the sequence 
listing objection was not sent with the June 5 office action, but was telefaxed on June 12, 2001. 

Prior to addressing the bases for rejecting the claims, it is worthwhile to discuss what the 
empirical evidence provided in the application shows. 

In example 1, a melanoma cell line is discussed. The prior art describes how various 
members of the CT family of antigens are expressed by this cell. The cell was used to determine 
expression of other members of the family. It should be borne in mind that the term "CT 
antigen" is discussed in the specification. 

The cells were studied, using the "SEREX" methodology. This method is known in the 
art. To perform SEREX, total RNA is removed from cells, converted to cDNA and, a cDNA 
library is constructed in a vector. In this case, "AZAP" was used. The AZAP cDNA library was 
then used to transform E. coli . 

After E. coli are transformed, the transformed cells are used to screen samples for 
antibodies. Allogeneic samples were used in example 1 . Essentially, one is determining if the 

25085722.1 c 



LUD 5538.1 CIP - JEL/NDH (09807399) 



protein encoded by the cDNA used to transform the E. coli reacts with antibodies in the sample. 
One knows that the reaction is not with a "native" E. coli protein, because in the SEREX method, 
such antibodies are "stripped" prior to reaction with the transformants. Applicants refer the 
examiner to the references cited at page 5, i.e., Sahin, et al , Proc. Natl. Acad Sci USA 92:11810- 
1 1813, and U.S. Patent No. 5,698,396, both of which are incorporated by reference. Full details 
of the SEREX methodology are described in these references. Hence, one knows that, if there 
are any positives, the reactive protein was from the cDNA library. 

The yield in this experiment was high, i.e., there were 65 positives. 

Example 2 describes, simply, attempts to identify the positive materials. Standard 
methods were used. 

Table 8, at page 8, shows that of the 65 positives 18 were known CT antigens. Hence, 
one knows from this that one can identify known members of the CT family. Ten sequences 
were not identifiable in public sequence libraries. There were also, 33 clones which belonged to 
the "KOC" family. 

Examples 3 and 5 deal with CT antigen, i.e., "CT7." Please see US Patent 
No. 6,297,364. 

Example 6 describes sequence analysis of the 33 members of the family found in the 
experiments of example 2. As this example makes manifestly clear, two different genes, referred 
to as KOC-2 and KOC-3 were identified. They were clearly not artifacts, because when a 
testicular cDNA library was analyzed, again using allogeneic samples, the molecules were 
found. 

In example 7, the DNA expression pattern of the genes was studies. KOC-2 was found in 
normal tissue, but only testis and, with modification of the assay, in normal kidney cells. This is 
a "classic" CT expression pattern. As noted in the application, "CT" means "cancer/testis." 

The KOC-3 gene was found to be expressed in both cancer and normal cells. 

Now, the examiner has rejected all of claims 53-73, 80-84, 108 and 109, arguing that the 
claims are not supported by either "a specific substantial utility or a well-established utility." 

The examiner states that "the specification fails to demonstrate a utility for the nucleic 
acids of SEQ ID NOS: 5-8, as the specification fails to correlated the presence of the protein 
encoded by SEQ ID NOS: 5-8 in clinical samples." The examiner then goes on to hypothesize 
that (i) antibodies can cross react with different proteins, and (ii) that the polynucleotides may 
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not be expressed due to any of a number of hypotheses. The examiner also states that "under 
conditions of recombinant expression, epitopes of KOC-2 and 3 which are not normally 
expressed in a patient within the context of MHC could be expressed by E. coli." 

All of this is unproven speculation. The facts are the following. Something in the serum 
of patient NW38 reacted with a protein expressed by cDNA from an allogeneic source. 
Antibodies were present in the cancer patient's serum. They did not react with an E. coli protein 
because, as has been pointed out, supra , when SEREX is carried out, any reactivity with E. coli 
proteins is eliminated by use of controls. Again, the examiner is directed to the references which 
describe the SEREX methodology. 

With respect to the elaborate argument regarding factors affecting translation of proteins, 
this is all well and good, but it is irrelevant. It is irrelevant because there were antibodies in the 
sample analyzed. Antibodies are produced in response to a protein. H*nce, at some point, 
NW38 (the source of the sample), did have the protein in his or her blood. It got there by being 
expressed. The issue for CT antigens, as was set forth very clearly in the specification, is: are 
they expressed in individuals with cancer? The evidence indicates that KOC-2 and KOC-3 are 
so expressed, regardless of what happened to the irrelevant ferritin polypeptide, ornithine 
decarboxylase, and p glycoprotein discussed by the examiner, and not shown to be cancer 
antigens. With respect to the failure of protein and mRNA levels to correlate for p53, again, this 
is irrelevant. Applicants have not claimed a quantitative assay. The correct question is: is the 
molecule expressed in cancer patients and usable for diagnosis? The correlation of p53 with 
cancer is too well known to comment on. 

With respect to the examiner's hypothetical argument that "antibodies bind to epitopes in 
proteins and antibodies can cross react with different proteins containing the same or similar 
proteins," this again ignores the evidence. In the SEREX methodology, antibodies against a host 
cell, e.g., E. coli , are removed in a stripping step. All that is left are antibodies which bind to 
non- E. coli proteins. Applicants used a cDNA library. It is well known that, when using a 
cDNA library, a single cDNA molecule is used as the transfectant. Hence, if an antibody reacts 
with a transfected or transformed cell, it must be reacting with the foreign protein, since 
antibodies to native proteins have been eliminated. Hence, all of the examiner's points are based 
upon generalizations which do not stand up to analysis when the data re considered. With 
respect to the statement bridging pages 4-5: 
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"The data indicating that the KOC-2 polynucleotide was restricted to the normal 
testis and that the KOC-3 polynucleotide was universal in normal tissues." 

This is absolutely an incorrect reading of the data. First of all, there were antibodies against 
KOC-2 and KOC-3 found in the patient (NW38) sample. Hence, there was no immunotolerance. 
Second, the data state that "the only positive normal tissue was testis." These do not state that 
this was the only positive tissue . There is a very big difference. Further, if the examiner is going 
to argue that levels of expression can vary, then she must concede that, notwithstanding 
expression of KOC-3 in normal tissue, cancer tissues express it at a level such that an immune 
response is possible - especially since applicants have shown this. 

Turning next to the rejection under 35 USC § 1 12, the statement of this rejection is not 
consistent. All claims are rejected, even though the examiner states that the claims are enabling 
for "polynucleotides comprising SEQ ID NOS: 5-8..." If this is the case, then claims 54-70 and 
108 and 109 should not be rejected, as claims 54-70 expressly recite what is said to be enabled, 

and claims 108 and 109 have nothing to do with the issues raised in the rejection. The rejection 

recites points "A" and "B," and it is noted that only claims 53, 71-73 and 80-84 are rejected. 

They are no reasons given for rejecting the remaining claims. 

Applicants contacted the examiner regarding this issue on October 22. The undersigned 

was advised that, if the utility rejection were overcome, then the issue at point 7 would be moot. 

It is noted that claims 54-70 are not rejected in the discussion following point 7. Claims 53 & 54 

were combined to form claim 1 10. As such, claim 1 10 and claims dependent thereon cannot be 

rejected under 35 USC §112. 

In view of the foregoing, allowance of this application is believed proper and is urged. 
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In view of the lack of clarity in this rejection, applicants cannot address it. The examiner 
is invited to clarify the rejection so it can be addressed. At such time, applicants will respond 
fully. 

Respectfully submitted, 
FULBRIGHT & JAWORSKI, L.L.P. 



666 Fifth Avenue 
New York, New York 10103-3198 
Telephone: 212-318-3168 
Telecopier: 212-318-3400 




Norman D. Hanson, Esq 
Registration No. 30,946 
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-<110> ^Zy^QhgQtSPTao-Tsenq 

Ali 

Tsang, Solam 
Stockert, Elisabeth 
Jager, Elke 
Knuth, Alexander 
Old, Lloyd J. 

<120> Isolated Nucleic Acid Molecules Encoding Cancer Associated Antigen, The 
Antigens Per Se, And Uses Thereof 



<130> LUD 5538.1 



<140> 09/270,437 
<141> 1999 - 03 - 16 
<150> 09/061,709 
<151> 1998-04-17 



<160> 23 



<210> 1 

<211> 4265 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 1 



GTCTGAAGGA CCTGAGGCAT TTTGTGACGA 
CTTATAGACC TATCCAGTCT TCAAGGTGCT 
GAGGGACACA TACATCCTAA AAGCACCACA 
AAGGTTCCCA GAAGACAAAC CCCCTAGGAA 
ACCTTAAGAG AAGAAGAGCT GTAAGCCGGC 
TATGCCTACT GCTGGGATGC CGAGTCTTCT 
TCCTGAGGGG GAGGACTCCC AGTCTCCTCT 
CGACACCCTG TATCCTCTCC AGAGTCCTCA 
TCCTCTCCAG AGACCTCCTG AGGGGAAGGA 
TTCTCCTGAG GGCGACGACA CCCAGTCTCC 
GAAGGACTCC CTGTCTCCTC TAGAGATTTC 
GTCTCCTCTG CAGAATCCTG CGAGTTCCTT 
GAGTTCCCCT GAGAGTATTC AAAGTCCTTT 
TCCTGTGAGC GCCGCCTCCT CCTCCACTTT 
TACTCAAAGT CCTTTTGAGG GTTTTCCCCA 
CTTCTCCTCC ACTTTATTGA GTATTTTCCA 
TGAGGGTTTT GCACAGTCTC CTCTCCAGAT 
ACTGAGTCTT TTCCAGAGTT CCCCTGAGAG 
GTCTCCACTC CAGATTCCTG TGAGCCGCTC 
GAGTTCCCCT GAGAGAACTC AGAGTACTTT 
TCCTGTGAGC CCCTCCTTCT CCTCCACTTT 
AACTCAGAGT ACTTTTGAGG GTTTTCCCCA 
CTTCTCCTCC ACTTTATTGA GTCTTTTCCA 
TGAGGGTTTT CCCCAGTCTC CTCTCCAGAT 
ACTGAGTCTT TTCCAGAGTT CCCCTGAGAG 
GTCTCCTCTC CAGATTCCTA TGACCTCCTC 
GAGTTCTCCT GAGAGTGCTC AAAGTGCTTT 
TCCTGTGAGC TCCTCTTTCT CCTACACTTT 
AACTCACAGT ACTTTTGAGG GTTTTCCCCA 



GGATCGTCTC AGGTCAGCGG AGGGAGGAGA 60 
CCAGAAAGCA GGAGTTGAAG ACCTGGGTGT 120 
GCAGAGGAGG CCCAGGCAGT GCCAGGAGTC 180 
GACAGGCGAC CTGTGAGGCC CTAGAGCACC 240 
CTTTGTCAGA GCCATCATGG GGGACAAGGA 300 
CCAGAGTTCC TCTGAGAGTC CTCAGAGTTG 360 
CCAGATTCCC CAGAGTTCTC CTGAGAGCGA 4 20 
GAGTCGTTCT GAGGGGGAGG ACTCCTCGGA 4 80 
CTCCCAGTCT CCTCTCCAGA TTCCCCAGAG 540 
TCTCCAGAAT TCTCAGAGTT CTCCTGAGGG 600 
TCAGAGCCCT CCTGAGGGTG AGGATGTCCA 660 
CTTCTCCTCT GCTTTATTGA GTATTTTCCA 720 
TGAGGGTTTT CCCCAGTCTG TTCTCCAGAT 780 
AGTGAGTATT TTCCAGAGTT CCCCTGAGAG 840 
GTCTCCACTC CAGATTCCTG TGAGCCGCTC 900 
GAGTTCCCCT GAGAGAAGTC AGAGAACTTC 960 
TCCTGTGAGC TCCTCCTCGT CCTCCACTTT 1020 
AACTCAGAGT ACTTTTGAGG GTTTTCCCCA 1080 
CTTCTCCTCC ACTTTATTGA GTATTTTCCA 114 0 
TGAGGGTTTT GCCCAGTCTC CTCTCCAGAT 1200 
AGTGAGTATT TTCCAGAGTT CCCCTGAGAG 1260 
GTCTCCTCTC CAGATTCCTG TGAGCTCCTC 1320 
GAGTTCCCCT GAGAGAACTC AGAGTACTTT 1380 
TCCTGGAAGC CCCTCCTTCT CCTCCACTTT 14 4 0 
AACTCACAGT ACTTTTGAGG GTTTTCCCCA 1500 
CTTCTCCTCT ACTTTATTGA GTATTTTACA 1560 
TGAGGGTTTT CCCCAGTCTC CTCTCCAGAT 1620 
ATTGAGTCTT TTCCAGAGTT CCCCTGAGAG 1680 
GTCTCCTCTC CAGATTCCTG TGAGCTCCTC 1740 



-CTCCTCCTCC TCCACTTTAT TGAGTCTTTT CCAGAGTTCC CCTGAGTGTA CTCAAAGTAC 1800 
TTTTGAGGGT TTTCCCCAGT CTCCTCTCCA GATTCCTCAG AGTCCTCCTG AAGGGGAGAA 1860 
TACCCATTCT CCTCTCCAGA TTGTTCCAAG TCTTCCTGAG TGGGAGGACT CCCTGTCTCC 1920 
TCACTACTTT CCTCAGAGCC CTCCTCAGGG GGAGGACTCC CTATCTCCTC ACTACTTTCC 1980 
TCAGAGCCCT CCTCAGGGGG AGGACTCCCT GTCTCCTCAC TACTTTCCTC AGAGCCCTCA 2040 
GGGGGAGGAC TCCCTGTCTC CTCACTACTT TCCTCAGAGC CCTCCTCAGG GGGAGGACTC 2100 
CATGTCTCCT CTCTACTTTC CTCAGAGTCC TCTTCAGGGG GAGGAATTCC AGTCTTCTCT 2160 
CCAGAGCCCT GTGAGCATCT GCTCCTCCTC CACTCCATCC AGTCTTCCCC AGAGTTTCCC 2220 
TGAGAGTTCT CAGAGTCCTC CTGAGGGGCC TGTCCAGTCT CCTCTCCATA GTCCTCAGAG 2280 
CCCTCCTGAG GGGATGCACT CCCAATCTCC TCTCCAGAGT CCTGAGAGTG CTCCTGAGGG 2340 
GGAGGATTCC CTGTCTCCTC TCCAAATTCC TCAGAGTCCT CTTGAGGGAG AGGACTCCCT 2400 
GTCTTCTCTC CATTTTCCTC AGAGTCCTCC TGAGT GGGAG GACTCCCTCT CTCCTCTCCA 24 60 
CTTTCCTCAG TTTCCTCCTC AGGGGGAGGA CTTCCAGTCT TCTCTCCAGA GTCCTGTGAG 2520 
TATCTGCTCC TCCTCCACTT CTTTGAGTCT TCCCCAGAGT TTCCCTGAGA GTCCTCAGAG 2580 
TCCTCCTGAG GGGCCTGCTC AGTCTCCTCT CCAGAGACCT GTCAGCTCCT TCTTCTCCTA 2640 
CACTTTAGCG AGTCTTCTCC AAAGTTCCCA TGAGAGTCCT CAGAGTCCTC CTGAGGGGCC 2700 
TGCCCAGTCT CCTCTCCAGA GTCCTGTGAG CTCCTTCCCC TCCTCCACTT CATCGAGTCT 2760 
TTCCCAGAGT TCTCCTGTGA GCTCCTTCCC CTCCTCCACT TCATCGAGTC TTTCCAAGAG 2820 
TTCCCCTGAG AGTCCTCTCC AGAGTCCTGT GATCTCCTTC TCCTCCTCCA CTTCATTGAG 2880 
CCCATTCAGT GAAGAGTCCA GCAGCCCAGT AGATGAATAT ACAAGTTCCT CAGACACCTT 2940 
GCTAGAGAGT GATTCCTTGA CAGACAGCGA GTCCTTGATA GAGAGCGAGC CCTTGTTCAC 3000 
TTATACACTG GATGAAAAGG TGGACGAGTT GGCGCGGTTT CTTCTCCTCA AATATCAAGT 3060 
GAAGCAGCCT ATCACAAAGG CAGAGATGCT GACGAATGTC ATCAGCAGGT ACACGGGCTA 3120 
CTTTCCTGTG ATCTTCAGGA AAGCCCGTGA GTTCATAGAG ATACTTTTTG GCATTTCCCT 3180 
GAGAGAAGTG GACCCTGATG ACTCCTATGT CTTTGTAAAC ACATTAGACC TCACCTCTGA 3240 
GGGGTGTCTG AGTGATGAGC AGGGCATGTC CCAGAACCGC CTCCTGATTC TTATTCTGAG 3300 
TATCATCTTC ATAAAGGGCA CCTATGCCTC TGAGGAGGTC ATCTGGGATG TGCTGAGTGG 3360 
AATAGGGGTG CGTGCTGGGA GGGAGCACTT TGCCTTTGGG GAGCCCAGGG AGCTCCTCAC 3420 
TAAAGTTTGG GTGCAGGAAC ATTACCTAGA GTACCGGGAG GTGCCCAACT CTTCTCCTCC 34 80 
TCGTTACGAA TTCCTGTGGG GTCCAAGAGC TCATTCAGAA GTCATTAAGA GGAAAGTAGT 3540 
AGAGTTTTTG GCCATGCTAA AGAATACCGT CCCTATTACC TTTCCATCCT CTTACAAGGA 3600 
TGCTTTGAAA GATGTGGAAG AGAGAGCCCA GGCCATAATT G AC AC C AC AG ATGATTCGAC 3660 
TGCCACAGAA AGTGCAAGCT CCAGTGTCAT GTCCCCCAGC TTCTCTTCTG AGTGAAGTCT 3720 
AGGGCAGATT CTTCCCTCTG AGTTTGAAGG GGGCAGTCGA GTTTCTACGT GGTGGAGGGC 3780 
CTGGTTGAGG CTGGAGAGAA CACAGTGCTA TTTGCATTTC TGTTCCATAT GGGTAGTTAT 3840 
GGGGTTTACC TGTTTTACTT TTGGGTATTT TTCAAATGCT TTTCCTATTA ATAACAGGTT 3900 
TAAATAGCTT CAGAATCCTA GTTTATGCAC ATGAGTCGCA CATGTATTGC TGTTTTTCTG 3960 
GTTTAAGAGT AACAGTTTGA TATTTTGTAA AAACAAAAAC ACACCCAAAC ACACCACATT 4020 
GGGAAAACCT TCTGCCTCAT TTTGTGATGT GTCACAGGTT AATGTGGTGT TACTGTAGGA 4 080 
ATTTTCTTGA AACTGTGAAG GAACTCTGCA GTTAAATAGT GGAATAAAGT AAAGGATTGT 414 0 
TAATGTTTGC ATTTCCTCAG GTCCTTTAGT CTGTTGTTCT TGAAAACTAA AGATACATAC 4200 
CTGGTTTGCT TGGCTTACGT AAGAAAGTCG AAGAAAGTAA ACTGTAATAA ATAAAAGTGT 4 2 60 
CAGTG 42 65 

<210> 2 
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45 






Tyr Pro 


Leu 


Gin Ser 


Pro 


Gin 


Ser 


Arg 


Ser 


Glu 


Gly Glu Asp 
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Leu 
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Pro 


Glu 


Gly 


Lys 
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Ser Gin Ser 


Pro 


Leu 



'65 70 75 80 

Gin lie Pro Gin Ser Ser Pro Glu Gly Asp Asp Thr Gin Ser Pro Leu 
85 90 95 



Gin Asn Ser Gin Ser Ser Pro Glu Gly Lys Asp Ser Leu Ser Pro Leu 

100 105 110 

Glu He Ser Gin Ser Pro Pro Glu Gly Glu Asp Val Gin Ser Pro Leu 

115 120 125 

Gin Asn Pro Ala Ser Ser Phe Phe Ser Ser Ala Leu Leu Ser He Phe 

130 135 140 

Gin Ser Ser Pro Glu Ser He Gin Ser Pro Phe Glu Gly Phe Pro Gin 
145 150 155 160 

Ser Val Leu Gin He Pro Val Ser Ala Ala Ser Ser Ser Thr Leu Val 

165 170 175 

Ser He Phe Gin Ser Ser Pro Glu Ser Thr Gin Ser Pro Phe Glu Gly 

180 185 190 

Phe Pro Gin Ser Pro Leu Gin He Pro Val Ser Arg Ser Phe Ser Ser 

195 200 205 

Thr Leu Leu Ser He Phe Gin Ser Ser Pro Glu Arg Ser Gin Arg Thr 

210 215 220 

Ser Glu Gly Phe Ala Gin Ser Pro Leu Gin He Pro Val Ser Ser Ser 
225 230 235 240 

Ser Ser Ser Thr Leu Leu Ser Leu Phe Gin Ser Ser Pro Glu Arg Thr 

245 250 255 

Gin Ser Thr Phe Glu Gly Phe Pro Gin Ser Pro Leu Gin He Pro Val 

260 265 270 

Ser Arg Ser Phe Ser Ser Thr Leu Leu Ser He Phe Gin Ser Ser Pro 

275 280 285 

Glu Arg Thr Gin Ser Thr Phe Glu Gly Phe Ala Gin Ser Pro Leu Gin 

290 295 300 

He Pro Val Ser Pro Ser Phe Ser Ser Thr Leu Val Ser He Phe Gin 
305 310 315 320 

Ser Ser Pro Glu Arg Thr Gin Ser Thr Phe Glu Gly Phe Pro Gin Ser 

325 330 335 

Pro Leu Gin He Pro Val Ser Ser Ser Phe Ser Ser Thr Leu Leu Ser 

340 345 350 

Leu Phe Gin Ser Ser Pro Glu Arg Thr Gin Ser Thr Phe Glu Gly Phe 

355 360 365 

Pro Gin Ser Pro Leu Gin He Pro Gly Ser Pro Ser Phe Ser Ser Thr 

370 375 380 

Leu Leu Ser Leu Phe Gin Ser Ser Pro Glu Arg Thr His Ser Thr Phe 
385 390 395 400 

Glu Gly Phe Pro Gin Ser Pro Leu Gin He Pro Met Thr Ser Ser Phe 

405 410 415 

Ser Ser Thr Leu Leu Ser He Leu Gin Ser Ser Pro Glu Ser Ala Gin 

420 425 430 

Ser Ala Phe Glu Gly Phe Pro Gin Ser Pro Leu Gin He Pro Val Ser 

435 440 445 

Ser Ser Phe Ser Tyr Thr Leu Leu Ser Leu Phe Gin Ser Ser Pro Glu 

450 455 460 

Arg Thr His Ser Thr Phe Glu Gly Phe Pro Gin Ser Pro Leu Gin He 
465 470 475 480 

Pro Val Ser Ser Ser Ser Ser Ser Ser Thr Leu Leu Ser Leu Phe Gin 

485 490 495 

Ser Ser Pro Glu Cys Thr Gin Ser Thr Phe Glu Gly Phe Pro Gin Ser 

500 505 510 

Pro Leu Gin He Pro Gin Ser Pro Pro Glu Gly Glu Asn Thr His Ser 

515 520 525 

Pro Leu Gin He Val Pro Ser Leu Pro Glu Trp Glu Asp Ser Leu Ser 
530 535 540 



Pro His Tyr Phe Pro Gin Ser Pro Pro Gin Gly Glu Asp Ser Leu Ser 
- 550 555 Sfin 

Pro His Tyr Phe Pro Gin Ser Pro Pro Gin" Gly Glu Asp Ser Leu Ser 

565 570 575 

Pro His Tyr Phe Pro Gin Ser Pro Gin Gly Glu Asp Ser Leu Ser Pro 

580 585 590 

His Tyr Phe Pro Gin Ser Pro Pro Gin Gly Glu Asp Ser Met Ser Pro 

i95 600 605 

Leu Tyr Phe Pro Gin Ser Pro Leu Gin Gly Glu Glu Phe Gin Ser Ser 

u 615 620 

Leu Gin Ser Pro Val Ser lie Cys Ser Ser Ser Thr Pro Ser Ser Leu 

630 635 640 

Pro Gin Ser Phe Pro Glu Ser Ser Gin Ser Pro Pro Glu Gly Pro Val 

645 650 655 

Gin Ser Pro Leu His Ser Pro Gin Ser Pro Pro Glu Gly Met His Ser 

660 665 670 

Gin Ser Pro Leu Gin Ser Pro Glu Ser Ala Pro Glu Gly Glu Asp Ser 
j 0 675 680 68 5 

Leu Ser Pro Leu Gin lie Pro Gin Ser Pro Leu Glu Gly Glu Asp Ser 

695 700 
Leu Ser Ser Leu His Phe Pro Gin Ser Pro Pro Glu Trp Glu Asp Ser 

710 "715 
Leu Ser Pro Leu His Phe Pro Gin Phe Pro Pro Gin Gly Glu Asp Phe 

725 730 735 

Gin Ser Ser Leu Gin Ser Pro Val Ser lie Cys Ser Ser Ser Thr Ser 

740 745 750 

Leu Ser Leu Pro Gin Ser Phe Pro Glu Ser Pro Gin Ser Pro Pro Glu 

755 760 765 

Gly Pro Ala Gin Ser Pro Leu Gin Arg Pro Val Ser Ser Phe Phe Ser 
770 775 780 

Tyr Thr Leu Ala Ser Leu Leu Gin Ser Ser His Glu Ser Pro Gin Ser 

790 795 80Q 

Pro Pro Glu Gly Pro Ala Gin Ser Pro Leu Gin Ser Pro Val Ser Ser 

805 810 815 

Phe Pro Ser Ser Thr Ser Ser Ser Leu Ser Gin Ser Ser Pro Val Ser 

820 825 830 

Ser Phe Pro Ser Ser Thr Ser Ser Ser Leu Ser Lys Ser Ser Pro Glu 

835 840 845 

Ser Pro Leu Gin Ser Pro Val lie Ser Phe Ser Ser Ser Thr Ser Leu 

850 855 860 

Ser Pro Phe Ser Glu Glu Ser Ser Ser Pro Val Asp Glu Tyr Thr Ser 
o „ 870 875 880 

Ser Ser Asp Thr Leu Leu Glu Ser Asp Ser Leu Thr Asp Ser Glu Ser 

885 890 895 

Leu lie Glu Ser Glu Pro Leu Phe Thr Tyr Thr Leu Asp Glu Lys Val 

900 905 910 

Asp Glu Leu Ala Arg Phe Leu Leu Leu Lys Tyr Gin Val Lys Gin Pro 

915 920 925 

lie Thr Lys Ala Glu Met Leu Thr Asn Val He Ser Arg Tyr Thr Glv 

930 935 940 

Tyr Phe Pro Val lie Phe Arg Lys Ala Arg Glu Phe lie Glu lie Leu 
9 * 5 950 955 960 

Phe Gly He Ser Leu Arg Glu Val Asp Pro Asp Asp Ser Tyr Val Phe 

9 65 970 975 

Val Asn Thr Leu Asp Leu Thr Ser Glu Gly Cys Leu Ser Asp Glu Gin 

980 985 990 

Gly Met Ser Gin Asn Arg Leu Leu He Leu He Leu Ser He He Phe 

"5 1000 10 05 

He Lys Gly Thr Tyr Ala Ser Glu Glu Val He Trp Asp Val Leu Ser 
1010 1015 1020 



•Gly He Gly Val Arg Ala Gly Arg Glu His Phe Ala Phe Gly Glu Pro 
1025 1030 1035 1040 

Arg Glu Leu Leu Thr Lys Val Trp Val Gin Glu His Tyr Leu Glu Tyr 

1045 - 1050 1055 

Arg Glu Val Pro Asn Ser Ser Pro Pro Arg Tyr Glu Phe Leu Trp Gly 

1060 1065 1070 

Pro Arg Ala His Ser Glu Val He Lys Arg Lys Val Val Glu Phe Leu 

1075 1080 1085 

Ala Met Leu Lys Asn Thr Val Pro He Thr Phe Pro Ser Ser Tyr Lys 

1090 1095 1100 

Asp Ala Leu Lys Asp Val Glu Glu Arg Ala Gin Ala He He Asp Thr 
H05 1110 1115 1120 

Thr Asp Asp Ser Thr Ala Thr Glu Ser Ala Ser Ser Ser Val Met Ser 

1125 1130 1135 

Pro Ser Phe Ser Ser Glu 
1140 



<210> 3 

<211> 7 

<212> PRT 

<213> Homo sapiens 

<220> 

<400> 3 



Pro Gin Ser Pro Leu Gin lie 
1 5 



<210> 4 

<211> 4159 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 4 

GGTGGATGCG TTTGGGTTGT AGCTAGGCTT TTTCTTTTCT TTCTCTTTTA AAACACATCT 60 

AGACAAGGAA AAAACAAGCC TCGGATCTGA TTTTTCACTC CTCGTTCTTG TGCTTGGTTC 120 

TTACTGTGTT TGTGTATTTT AAAGGCGAGA AGACGAGGGG AACAAAACCA GCTGGATCCA 180 

TCCATCACCG TGGGTGGTTT TAATTTTTCG TTTTTTCTCG TTATTTTTTT TTAAACAACC 240 

ACTCTTCACA ATGAACAAAC TGTATATCGG AAACCTCAGC GAGAACGCCG CCCCCTCGGA 300 

CCTAGAAAGT ATCTTCAAGG ACGCCAAGAT CCCGGTGTCG GGACCCTTCC TGGTGAAGAC 360 

TGGCTACGCG TTCGTGGACT GCCCGGACGA GAGCTGGGCC CTCAAGGCCA TCGAGGCGCT 4 20 

TTCAGGTAAA ATAGAACTGC ACGGGAAACC CATAGAAGTT GAGCACTCGG TCCCAAAAAG 4 80 

GCAAAGGATT CGGAAACTTC AG AT AC G AAA TATCCCGCCT CATTTACAGT GGGAGGTGCT 540 

GGATAGTTTA CTAGTCCAGT ATGGAGTGGT GGAGAGCTGT GAGCAAGTGA AC ACT G ACT C 600 

GGAAACTGCA GTTGTAAATG TAACCTATTC CAGTAAGGAC CAAGCTAGAC AAGCACTAGA 660 

CAAACTGAAT GGATTTCAGT TAGAGAATTT CACCTTGAAA GTAGCCTATA TCCCTGATGA 720 

AATGGCCGCC CAGCAAAACC CCTTGCAGCA GCCCCGAGGT CGCCGGGGGC TTGGGCAGAG 780 

GGGCTCCTCA AGGCAGGGGT CTCCAGGATC CGTATCCAAG CAGAAACCAT GTGATTTGCC 840 

TCTGCGCCTG CTGGTTCCCA CCCAATTTGT TGGAGCCATC ATAGGAAAAG AAGGTGCCAC 900 

CATTCGGAAC ATCACCAAAC AGACCCAGTC TAAAATCGAT GTCCACCGTA AAGAAAATGC 960 

GGGGGCTGCT GAGAAGTCGA TTACTATCCT CTCTACTCCT GAAGGCACCT CTGCGGCTTG 1020 

TAAGTCTATT CTGGAGATTA TGCATAAGGA AGCTCAAGAT ATAAAATTCA CAGAAGAGAT 1080 

CCCCTTGAAG ATTTTAGCTC ATAATAACTT TGTTGGACGT CTTATTGGTA AAGAAGGAAG 114 0 

AAATCTTAAA AAAATTGAGC AAGACACAGA CACTAAAATC ACGATATCTC CATTGCAGGA 1200 

ATTGACGCTG TATAATCCAG AACGCACTAT TACAGTTAAA GGCAATGTTG AGACATGTGC 1260 

CAAAGCTGAG GAGGAGATCA TGAAGAAAAT CAGGGAGTCT TATGAAAATG ATATTGCTTC 1320 

TATGAATCTT CAAGCACATT TAATTCCTGG ATTAAATCTG AACGCCTTGG GTCTGTTCCC 1380 



•ACCCACTTCA GGGATGCCAC CTCCCACCTC AGGGCCCCCT TCAGCCATGA CTCCTCCCTA 1440 
CCCqCAGTTT GAGCAATCAG AAACGGAGAC TGTTCATCAG TTTATCCCAG CTCTATCAGT 1500 
CGGTGCCATC ATCGGCAAGC AGGGCCAGCA CATCAAGCAG CTTTCTCGCT TTGCTGGAGC 1560 
TTCAATTAAG ATTGCTCCAG CGGAAGCACC AGATGCTAAA GTGAGGATGG TGATTATCAC 1620 
TGGACCACCA GAGGCTCAGT TCAAGGCTCA GGGAAGAATT TATGGAAAAA TTAAAGAAGA 1680 
AAACTTTGTT AGTCCTAAAG AAGAGGTGAA ACTTGAAGCT CATATCAGAG TGCCATCCTT 1740 
TGCTGCTGGC AGAGTTATTG GAAAAGGAGG CAAAACGGTG AATGAACTTC AGAATTTGTC 1800 
AAGTGCAGAA GTTGTTGTCC CTCGTGACCA GACACCTGAT GAGAATGACC AAGTGGTTGT 1860 
CAAAATAACT GGTCACTTCT ATGCTTGCCA GGTTGCCCAG AGAAAAATTC AGGAAATTCT 1920 
GACTCAGGTA AAGCAGCACC AACAACAGAA GGCTCTGCAA AGTGGACCAC CTCAGTCAAG 1980 
ACGGAAGTAA AGGCTCAGGA AACAGCCCAC CACAGAGGCA GATGCCAAAC CAAAGACAGA 2040 
TTGCTTAACC AACAGATGGG CGCTGACCCC CTATCCAGAA TCACATGCAC AAGTTTTTAC 2100 
CTAGCCAGTT GTTTCTGAGG ACCAGGCAAC TTTTGAACTC CTGTCTCTGT GAGAATGTAT 2160 
ACTTTATGCT CTCTGAAATG TATGACACCC AGCTTTAAAA CAAACAAACA AACAAACAAA 2220 
AAAAGGGTGG GGGAGGGAGG GAAAGAGAAG AGCTCTGCAC TTCCCTTTGT TGTAGTCTCA 2280 
CAGTATAACA GATATTCTAA TTCTTCTTAA TATTCCCCCA TAATGCCAGA AATTGGCTTA 2340 
ATGATGCTTT CACTAAATTC ATCAAATAGA TTGCTCCTAA ATCCAATTGT TAAAATTGGA 2400 
TCAGAATAAT TATCACAGGA ACTTAAATGT TAAGCCATTA GCATAGAAAA ACTGTTCTCA 24 60 
GTTTTATTTT TACCTAACAC TAACATGAGT AACCTAAGGG AAGTGCTGAA TGGTGTTGGC 2520 
AGGGGTATTA AACGTGCATT TTTACTCAAC TACCTCAGGT ATTCAGTAAT ACAATGAAAA 2580 
GCAAAATTGT TCCTTTTTTT TGAAAATTTT ATATACTTTA TAATGATAGA AGTCCAACCG 2640 
TTTTTTAAAA AATAAATTTA AAATTTAACA GCAATCAGCT AACAGGCAAA TTAAGATTTT 2700 
TACTTCTGGC TGGTGACAGT AAAGCTGGAA AATTAATTTC AGGGTTTTTT GAGGCTTTTG 2760 
ACACAGTTAT TAGTTAAATC AAATGTTCAA AAATACGGAG CAGTGCCTAG TATCTGGAGA 2820 
GCAGCACTAC CATTTATTCT TTCATTTATA GTTGGGAAAG TTTTTGACGG TACTAACAAA 2880 
GTGGTCGCAG GAGATTTTGG AACGGCTGGT TTAAATGGCT TCAGGAGACT TCAGTTTTTT 2940 
GTTTAGCTAC ATGATTGAAT GCATAATAAA TGCTTTGTGC TTCTGACTAT CAATACCTAA 3000 
AGAAAGTGCA TCAGTGAAGA GATGCAAGAC TTTCAACTGA CTGGCAAAAA GCAAGCTTTA 3060 
GCTTGTCTTA TAGGATGCTT AGTTTGCCAC TACACTTCAG ACCAATGGGA CAGTCATAGA 3120 
TGGTGTGACA GTGTTTAAAC GCAACAAAAG GCTACATTTC CATGGGGCCA GCACTGTCAT 3180 
GAGCCTCACT AAGCTATTTT GAAGATTTTT AAGCACTGAT AAATTAAAAA AAAAAAAAAA 3240 
AAATTAGACT CCACCTTAAG TAGTAAAGTA TAACAGGATT TCTGTATACT GTGCAATCAG 3300 
TTCTTTGAAA AAAAAGTCAA AAGATAGAGA ATACAAGAAA AGTTTTNGGG ATATAATTTG 3360 
AATGACTGTG AAAACATATG ACCTTTGATA ACGAACTCAT TTGCTCACTC CTTGACAGCA 3420 
AAGCCCAGTA CGTACAATTG TGTTGGGTGT GGGTGGTCTC CAAGGCCACG CTGCTCTCTG 3480 
AATTGATTTT TTGAGTTTTG GNTTGNAAGA TGATCACAGN CATGTTACAC TGATCTTNAA 354 0 
GGACATATNT TATAACCCTT TAAAAAAAAA ATCCCCTGCC TCATTCTTAT TTCGAGATGA 3600 
ATTTCGATAC AG AC T AG AT G TCTTTCTGAA GATCAATTAG ACATTNTGAA AATGATTTAA 3660 
AGTGTTTTCC TTAATGTTCT CTGAAAACAA GTTTCTTTTG TAGTTTTAAC CAAAAAAGTG 3720 
CCCTTTTTGT CACTGGTTTC TCCTAGCATT CATGATTTTT TTTTCACACA ATGAATTAAA 3780 
ATTGCTAAAA TCATGGACTG GCTTTCTGGT TGGATTTCAG GTAAGATGTG TTTAAGGCCA 384 0 
GAGCTTTTCT CAGTATTTGA TTTTTTTCCC CAATATTTGA TTTTTTAAAA ATATACACAT 3900 
AGGAGCTGCA TTTAAAACCT GCTGGTTTAA ATTCTGTCAN ATTTCACTTC TAGCCTTTTA 3960 
GTATGGCNAA TCANAATTTA CTTTTACTTA AGCATTTGTA ATTTGGAGTA TCTGGTACTA 4 020 
GCTAAGAAAT AATTCNATAA TTGAGTTTTG TACTCNCCAA ANATGGGTCA TTCCTCATGN 4080 
ATAATGTNCC CCCAATGCAG CTTCATTTTC CAGANACCTT GACGCAGGAT AAATTTTTTC 4140 
ATCATTTAGG TCCCCAAAA /icq 



<210> 5 
<211> 1708 
<212> DNA 

<213> Homo sapiens 

<220> 

<400> 5 

AGGGACGCTG CCGCACCGCC CCAGTTTACC CCGGGGAGCC ATCATGAAGC TGAATGGCCA 60 
CCAGTTGGAG AACCATGCCC TGAAGGTCTC CTACATCCCC GATGAGCAGA TAGCACAGGG 120 
ACCTGAGAAT GGGCGCCGAG GGGGCTTTGG CTCTCGGGGT CAGCCCCGCC AGGGCTCACC 180 
TGTGGCAGCG GGGGCCCCAG CCAAGCAGCA GCAAGTGGAC ATCCCCCTTC GGCTCCTGGT 240 



"GCCCACCCAG TATGTGGGTG CCATTATTGG CAAGGAGGGG GCCACCATCC GCAACATCAC 300 
AAAACAGACC CAGTCCAAGA TAGACGTGCA TAGGAAGGAG AACGCAGGTG CAGCTGAAAA 360 
AGCCATCAGT GTGCACTCCA CCCCTGAGGG CTGCTCCTCC GCTTGTAAGA TGATCTTGGA 420 
GATTATGCAT AAAGAGGCTA AGGACACCAA &ACGGCTGAC GAGGTTCCCC TGAAGATCCT 480 
GGCCCATAAT AACTTTGTAG GGCGTCTCAT TGGCAAGGAA GGACGGAACC TGAAGAAGGT 540 
AGAGCAAGAT ACCGAGACAA AAATCACCAT CTCCTCGTTG CAAGACCTTA CCCTTTACAA 600 
CCCTGAGAGG ACCATCACTG TGAAGGGGGC CATCGAGAAT TGTTGCAGGG CCGAGCAGGA 660 
AATAATGAAG AAAGTTCGGG AGGCCTATGA GAATGATGTG GCTGCCATGA GCTCTCACCT 720 
GATCCCTGGC CTGAACCTGG CTGCTGTAGG TCTTTTCCCA GCTTCATCCA GCGCAGTCCC 780 
GCCGCCTCCC AGCAGCGTTA CTGGGGCTGC TCCCTATAGC TCCTTTATGC AGGCTCCCGA 840 
GCAGGAGATG GTGCAGGTGT TTATCCCCGC CCAGGCAGTG GGCGCCATCA TCGGCAAGAA 900 
GGGGCAGCAC ATCAAACAGC TCTCCCGGTT TGCCAGCGCC TCCATCAAGA TTGCACCACC 960 
CGAAACACCT GACTCCAAAG TTCGTATGGT TATCATCACT GGACCGCCAG AGGCCCAATT 1020 
CAAGGCTCAG GGAAGAATCT ATGGCAAACT CAAGGAGGAG AACTTCTTTG GTCCCAAGGA 1080 
GGAAGTGAAG CTGGAGACCC ACATACGTGT GCCAGCATCA GCAGCTGGCC GGGTCATTGG 1140 
CAAAGGTGGA AAAACGGTGA ACGAGTTGCA GAATTTGACG GCAGCTGAGG TGGTAGTACC 1200 
AAGAGACCAG ACCCCTGATG AGAACGACCA GGTCATCGTG AAAATCATCG GACATTTCTA 1260 
TGCCAGTCAG ATGGCTCAAC GGAAGATCCG AGACATCCTG GCCCAGGTTA AGCAGCAGCA 1320 
TCAGAAGGGA CAGAGTAACC AGGCCCAGGC ACGGAGGAAG TGACCAGCCC CTCCCTGTCC 1380 
CTTNGAGTCC AGGACAACAA CGGGCAGAAA TCGAGAGTGT GCTCTCCCCG GCAGGCCTGA 14 4 0 
GAATGAGTGG GAATCCGGGA CACNTGGGCC GGGCTGTAGA TCAGGTTTGC CCACTTGATT 1500 
GAGAAAGATG TTCCAGTGAG GAACCCTGAT CTNTCAGCCC CAAACACCCA CCCAATTGGC 1560 
CCAACACTGT NTGCCCCTCG GGGTGTCAGA AATTNTAGCG CAAGGCACTT TTAAACGTGG 1620 
ATTGTTTAAA GAAGCTCTCC AGGCCCCACC AAGAGGGTGG ATCACACCTC AGTGGGAAGA 1680 
AAAATAAAAT TTCCTTCAGG TTTTAAAA 1708 



<210> 6 

<211> 3412 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 6 

GGCAGCGGAG GAGGCGAGGA GCGCCGGGTA CCGGGCCGGG GGAGCCGCGG GCTCTCGGGG 
AAGAGACGGA TGATGAACAA GCTTTACATC GGGAACCTGA GCCCCGCCGT CACCGCCGAC 
GACCTCCGGC AGCTCTTTGG GGACAGGAAG CTGCCCCTGG CGGGACAGGT CCTGCTGAAG 
TCCGGCTACG CCTTCGTGGA CTACCCCGAC CAGAACTGGG CCATCCGCGC CATCGAGACC 
CTCTCGGGTA AAGTGGAATT GCATGGGAAA ATCATGGAAG TTGATTACTC AGTCTCTAAA 
AAGCTAAGGA GCAGGAAAAT TCAGATTCGA AACATCCCTC CTCACCTGCA GTGGGAGGTG 
TTGGATGGAC TTTTGGCTCA AT AT GGG AC A GTGGAGAATG TGGAACAAGT CAACACAGAC 
ACAGAAACCG CCGTTGTCAA CGTCACATAT GCAACAAGAG AAGAAGCAAA AATAGCCATG 
GAGAAGCTAA GCGGGCATCA GTTTGAGAAC TACTCCTTCA AGATTTCCTA CATCCCGGAT 
GAAGAGGTGA GCTCCCCTTC GCCCCCTCAG CGAGCCCAGC GTGGGGACCA CTCTTCCCGG 
GAGCAAGGCC ACGCCCCTGG GGGCACTTCT CAGGCCAGAC AGATTGATTT CCCGCTGCGG 
ATCCTGGTCC CCACCCAGTT TGTTGGTGCC ATCATCGGAA AGGAGGGCTT GACCATAAAG 
AACATCACTA AGCAGACCCA GTCCCGGGTA GATATCCATA GAAAAGAGAA CTCTGGAGCT 
GCAGAGAAGC CTGTCACCAT CCATGCCACC CCAGAGGGGA CTTCTGAAGC ATGCCGCATG 
ATTCTTGAAA TCATGCAGAA AGAGGCAGAT GAGACCAAAC TAGCCGAAGA GATTCCTCTG 
AAAATCTTGG CACACAATGG CTTGGTTGGA AGACTGATTG GAAAAGAAGG CAGAAATTTG 
AAGAAAATTG AACATGAAAC AGGGACCAAG ATAACAATCT CATCTTTGCA GGATTTGAGC 
ATATACAACC CGGAAAGAAC CATCACTGTG AAGGGCACAG TTGAGGCCTG TGCCAGTGCT 
GAGATAGAGA TTATGAAGAA GCTGCGTGAG GCCTTTGAAA ATGATATGCT GGCTGTTAAC 
CAACAAGCCA ATCTGATCCC AGGGTTGAAC CTCAGCGCAC TTGGCATCTT TTCAACAGGA 
CTGTCCGTGC TATCTCCACC AGCAGGGCCC CGCGGAGCTC CCCCCGCTGC CCCCTACCAC 
CCCTTCACTA CCCACTCCGG ATACTTCTCC AGCCTGTACC CCCATCACCA GTTTGGCCCG 
TTCCCGCATC ATCACTCTTA TCCAGAGCAG GAGATTGTGA ATCTCTTCAT CCCAACCCAG 
GCTGTGGGCG CCATCATCGG GAAGAAGGGG GCACACATCA AACAGCTGGC GAGATTCGCC 
GGAGCCTCTA TCAAGATTGC CCCTGCGGAA GGCCCAGACG TCAGCGAAAG GAT GGT CATC 
ATCACCGGGC CACCGGAAGC CCAGTTCAAG GCCCAGGGAC GGATCTTTGG GAAACTGAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



"GAGGAAAACT TCTTTAACCC CAAAGAAGAA GTGAAGCTGG AAGCGCATAT CAGAGTGCCC 1620 
TCTTCCACAG CTGGCCGGGT GATTGGCAAA GGTGGCAAGA CCGTGAACGA ACTGCAGAAC 1680 
TTAACCAGTG CAGAAGTCAT CGTGCCTCGT GACCAAACGC CAGATGAAAA TGAGGAAGTG 1740 
ATCGTCAGAA TTATCGGGCA CTTCTTTGCT AGCCAGACTG CACAGCGCAA GATCAGGGAA 1800 
ATTGTACAAC AGGTGAAGCA GCAGGAGCAG AAATACCCTC AGGGAGTCGC CTCACAGCGC 1860 
AGCAAGTGAG GCTCCCACAG GCACCAGCAA AACAACGGAT GAATGTAGCC CTTCCAACAC 1920 
CTGACAGAAT GAGACCAAAC GCAGCCAGCC AGATCGGGAG CAAACCAAAG ACCATCTGAG 1980 
GAATGAGAAG TCTGCGGAGG CGGCCAGGGA CTCTGCCGAG GCCCTGAGAA CCCCAGGGGC 2040 
CGAGGAGGGG CGGGGAAGGT CAGCCAGGTT TGCCAGAACC ACCGAGCCCC GCCTCCCGCC 2100 
CCCCAGGGCT TCTGCAGGCT TCAGCCATCC ACTTCACCAT CCACTCGGAT CTCTCCTGAA 2160 
CTCCCACGAC GCTATCCCTT TTAGTTGAAC TAACATAGGT GAACGTGTTC AAAGCCAAGC 2220 
AAAATGCACA CCCTTTTTCT GTGGCAAATC GTCTCTGTAC ATGTGTGTAC ATATTAGAAA 2280 
GGGAAGATGT TAAGATATGT GGCCTGTGGG TTACACAGGG TGCCTGCAGC GGTAATATAT 2340 
TTTAGAAATA AT AT AT C AAA TAACTCAACT AACTCCAATT TTTAATCAAT TATTAATTTT 2400 
TTTTTCTTTT TAAAGAGAAA GCAGGCTTTT CTAGACTTTA AAGAATAAAG TCTTTGGGAG 24 60 
GTCTCACGGT GTAGAGAGGA GCTTTGAGGC CACCCGCACA AAATTCACCC AGAGGGAAAT 2520 
CTCGTCGGAA GGACACTCAC GGCAGTTCTG GATCACCTGT GTATGTCAAC AGAAGGGATA 2580 
CCGTCTCCTT GAAGAGGAAA CTCTGTCACT CCTCATGCCT GTCTAGCTCA TACACCCATT 2640 
TCTCTTTGCT TCACAGGTTT TAAACTGGTT TTTTGCATAC TGCTATATAA TTCTCTGTCT 2700 
CTCTCTGTTT ATCTCTCCCC TCCCTCCCCT CCCCTTCTTC TCCATCTCCA TTCTTTTGAA 27 60 
TTTCCTCATC CCTCCATCTC AATCCCGTAT CTACGCACCC CCCCCCCCCC AGGCAAAGCA 2820 
GTGCTCTGAG TATCACATCA CACAAAAGGA ACAAAAGCGA AACACACAAA CCAGCCTCAA 2880 
CTTACACTTG GTTACTCAAA AGAACAAGAG TCAATGGTAC TTGTCCTAGC GTTTTGGAAG 2940 
AGGAAAACAG GAACCCACCA AACCAACCAA TCAACCAAAC AAAGAAAAAA TTCCACAATG 3000 
AAAGAATGTA TTTTGTCTTT TTGCATTTTG GTGTATAAGC CATCAATATT CAGCAAAATG 3060 
ATTCCTTTCT TTAAAAAAAA AAATGTGGAG GAAAGTAGAA ATTTACCAAG GTTGTTGGCC 3120 
CAGGGCGTTA AATTCACAGA TTTTTTTAAC GAGAAAAACA CACAGAAGAA GCTACCTCAG 3180 
GTGTTTTTAC CTCAGCACCT TGCTCTTGTG TTTCCCTTAG AGATTTTGTA AAGCTGATAG 3240 
TTGGAGCATT TTTTTATTTT TTTAATAAAA ATGAGTTGGA AAAAAAATAA GATATCAACT 3300 
GCCAGCCTGG AGAAGGTGAC AGTCCAAGTG TGCAACAGCT GTTCTGAATT GTCTTCCGCT 3360 
AGCCAAGAAC CNATATGGCC TTCTTTTGGA CAAACCTTGA AAATGTTTAT TT 3412 



<210> 7 
<211> 1946 
<212> DNA 

<213> Homo sapiens 

<220> 

<400> 7 

GCTGTAGCGG AGGGGCTGGG GGGCTGCTCT 
CCACCCAGAG GCCGGGGTGG GAGGGCGAGT 
TTCAAATCCG AAATATTCCA CCCCAGCTCC 
AGTATGGTAC AGTAGAGAAC TGTGAGCAAG 
ATGTCACCTA TTCCAACCGG GAGCAGACCA 
AGTTGGAGAA CCATGCCCTG AAGGTCTCCT 
CTGAGAATGG GCGCCGAGGG GGCTTTGGCT 
TGGCAGCGGG GGCCCCAGCC AAGCAGCAGC 
CCACCCAGTA TGTGGGTGCC ATTATTGGCA 
AACAGACCCA GTCCAAGATA GACGTGCATA 
CCATCAGTGT GCACTCCACC CCTGAGGGCT 
TTATGCATAA AGAGGCTAAG GACACCAAAA 
CCCATAATAA CTTTGTAGGG CGTCTCATTG 
AGCAAGATAC CGAGACAAAA ATCACCATCT 
CTGAGAGGAC CATCACTGTG AAGGGGGCCA 
TAATGAAGAA AGTTCGGGAG GCCTATGAGA 
TCCCTGGCCT GAACCTGGCT GCTGTAGGTC 
CGCCTCCCAG CAGCGTTACT GGGGCTGCTC 
AGGAGATGGT GCAGGTGTTT ATCCCCGCCC 
GGCAGCACAT CAAACAGCTC TCCCGGTTTG 



GTCCCCTTCC TTGCGCGCTG CGGCCTCAGC 60 
GCTCAGCTTC CCGGGTTAGG AGCCGGAAAA 120 
GATGGGAAGT ACTGGACAGC CTGCTGGCTC 180 
TGAACACCGA GAGTGAGACG GCAGTGGTGA 240 
GGCAAGCCAT CATGAAGCTG AATGGCCACC 300 
ACATCCCCGA TGAGCAGATA GCACAGGGAC 360 
CTCGGGGTCA GCCCCGCCAG GGCTCACCTG 420 
AAGTGGACAT CCCCCTTCGG CTCCTGGTGC 4 80 
AGGAGGGGGC CACCATCCGC AACATCACAA 540 
GGAAGGAGAA CGCAGGTGCA GCTGAAAAAG 600 
GCTCCTCCGC TTGTAAGATG ATCTTGGAGA 660 
CGGCTGACGA GGTTCCCCTG AAGATCCTGG 720 
GCAAGGAAGG ACGGAACCTG AAGAAGGTAG 780 
CCTCGTTGCA AGACCTTACC CTTTACAACC 840 
TCGAGAATTG TTGCAGGGCC GAGCAGGAAA 900 
ATGATGTGGC TGCCATGAGC TCTCACCTGA 960 
TTTTCCCAGC TTCATCCAGC GCAGTCCCGC 1020 
CCTATAGCTC CTTTATGCAG GCTCCCGAGC 1080 
AGGCAGTGGG CGCCATCATC GGCAAGAAGG 1140 
CCAGCGCCTC CATCAAGATT GCACCACCCG 1200 



* AAACACCTGA CTCCAAAGTT CGTATGGTTA TCATCACTGG ACCGCCAGAG GCCCAATTCA 1260 
AGGQTCAGGG AAGAATCTAT GGCAAACTCA AGGAGGAGAA CTTCTTTGGT CCCAAGGAGG 1320 
AAGTGAAGCT GGAGACCCAC ATACGTGTGC CAGCATCAGC AGCTGGCCGG GTCATTGGCA 1380 
AAGGTGGAAA AACGGTGAAC GAGTTGCAGA ATTTGACGGC AGCTGAGGTG GTAGTACCAA 14 4 0 
GAGACCAGAC CCCTGATGAG AACGACCAGG TCATCGTGAA AATCATCGGA CATTTCTATG 1500 
CCAGTCAGAT GGCTCAACGG AAGATCCGAG ACATCCTGGC CCAGGTTAAG CAGCAGCATC 1560 
AGAAGGGACA GAGTAACCAG GCCCAGGCAC GGAGGAAGTG ACCAGCCCCT CCCTGTCCCT 1620 
TNGAGTCCAG GACAACAACG GGCAGAAATC GAGAGTGTGC TCTCCCCGGC AGGCCTGAGA 1680 
ATGAGTGGGA ATCCGGGACA CNTGGGCCGG GCTGTAGATC AGGTTTGCCC ACTTGATTGA 1740 
GAAAGATGTT CCAGTGAGGA ACCCTGATCT NTCAGCCCCA AACACCCACC CAATTGGCCC 1800 
AACACTGTNT GCCCCTCGGG GTGTCAGAAA TTNTAGCGCA AGGCACTTTT AAACGTGGAT 1860 
TGTTTAAAGA AGCTCTCCAG GCCCCACCAA GAGGGTGGAT CACACCTCAG TGGGAAGAAA 1920 
AATAAAATTT CCTTCAGGTT TTAAAA 194 6 



<210> 8 
<211> 3283 
<212> DNA 

<213> Homo sapiens 

<220> 

<400> 8 

GGCAGCGGAG GAGGCGAGGA GCGCCGGGTA CCGGGCCGGG GGAGCCGCGG GCTCTCGGGG 60 
AAGAGACGGA TGATGAACAA GCTTTACATC GGGAACCTGA GCCCCGCCGT CACCGCCGAC 120 
GACCTCCGGC AGCTCTTTGG GGACAGGAAG CTGCCCCTGG CGGGACAGGT CCTGCTGAAG 180 
TCCGGCTACG CCTTCGTGGA CTACCCCGAC CAGAACTGGG CCATCCGCGC CATCGAGACC 240 
CTCTCGGGTA AAGTGGAATT GCATGGGAAA ATCATGGAAG TTGATTACTC AGTCTCTAAA 300 
AAGCTAAGGA GCAGGAAAAT TCAGATTCGA AACATCCCTC CTCACCTGCA GTGGGAGGTG 360 
TTGGATGGAC TTTTGGCTCA AT AT GGG AC A GTGGAGAATG TGGAACAAGT CAACACAGAC 420 
ACAGAAACCG CCGTTGTCAA CGTCACATAT GCAACAAGAG AAGAAGCAAA AATAGCCATG 480 
GAGAAGCTAA GCGGGCATCA GTTTGAGAAC TACTCCTTCA AGATTTCCTA CATCCCGGAT 540 
GAAGAGGTGA GCTCCCCTTC GCCCCCTCAG CGAGCCCAGC GTGGGGACCA CTCTTCCCGG 600 
GAGCAAGGCC ACGCCCCTGG GGGCACTTCT CAGGCCAGAC AG ATT GAT TT CCCGCTGCGG 660 
ATCCTGGTCC CCACCCAGTT TGTTGGTGCC ATCATCGGAA AGGAGGGCTT GACCATAAAG 720 
AACATCACTA AGCAGACCCA GTCCCGGGTA GATATCCATA GAAAAGAGAA CTCTGGAGCT 780 
GCAGAGAAGC CTGTCACCAT CCATGCCACC CC AG AG GGG A CTTCTGAAGC ATGCCGCATG 840 
ATTCTTGAAA TCATGCAGAA AGAGGCAGAT GAGACCAAAC TAGCCGAAGA GATTCCTCTG 900 
AAAATCTTGG CACACAATGG CTTGGTTGGA AGACTGATTG GAAAAGAAGG CAGAAATTTG 960 
AAGAAAATTG AACATGAAAC AG GG ACCAAG ATAACAATCT CATCTTTGCA GGATTTGAGC 1020 
ATATACAACC CGGAAAGAAC CATCACTGTG AAGGGCACAG TTGAGGCCTG TGCCAGTGCT 1080 
GAGATAGAGA TTATGAAGAA GCTGCGTGAG GCCTTTGAAA ATGATATGCT GGCTGTTAAC 1140 
ACCCACTCCG GATACTTCTC CAGCCTGTAC CCCCATCACC AGTTTGGCCC GTTCCCGCAT 1200 
CATCACTCTT ATCCAGAGCA GGAGATTGTG AATCTCTTCA TCCCAACCCA GGCTGTGGGC 1260 
GCCATCATCG GGAAGAAGGG GGCACACATC AAACAGCTGG CGAGATTCGC CGGAGCCTCT 1320 
ATCAAGATTG CCCCTGCGGA AGGCCCAGAC GTCAGCGAAA GGATGGTCAT CATCACCGGG 1380 
CCACCGGAAG CCCAGTTCAA GGCCCAGGGA CGGATCTTTG GGAAACTGAA AGAGGAAAAC 1440 
TTCTTTAACC CCAAAGAAGA AGTGAAGCTG GAAGCGCATA TCAGAGTGCC CTCTTCCACA 1500 
GCTGGCCGGG TGATTGGCAA AGGTGGCAAG ACCGTGAACG AACTGCAGAA CTTAACCAGT 1560 
GCAGAAGTCA TCGTGCCTCG TGACCAAACG CCAGATGAAA ATGAGGAAGT GATCGTCAGA 1620 
ATTATCGGGC ACTTCTTTGC TAGCCAGACT GCACAGCGCA AGATCAGGGA AATTGTACAA 1680 
CAGGTGAAGC AGCAGGAGCA GAAATACCCT CAGGGAGTCG CCTCACAGCG CAGCAAGTGA 1740 
GGCTCCCACA GGCACCAGCA AAACAACGGA TGAATGTAGC CCTTCCAACA CCTGACAGAA 1800 
TGAGACCAAA CGCAGCCAGC CAGATCGGGA GCAAACCAAA GACCATCTGA GGAATGAGAA 18 60 
GTCTGCGGAG GCGGCCAGGG ACTCTGCCGA GGCCCTGAGA ACCCCAGGGG CCGAGGAGGG 1920 
GCGGGGAAGG TCAGCCAGGT TTGCCAGAAC CACCGAGCCC CGCCTCCCGC CCCCCAGGGC 1980 
TTCTGCAGGC TTCAGCCATC CACTTCACCA TCCACTCGGA TCTCTCCTGA ACTCCCACGA 204 0 
CGCTATCCCT TTTAGTTGAA CTAACATAGG TGAACGTGTT CAAAGCCAAG CAAAATGCAC 2100 
ACCCTTTTTC TGTGGCAAAT CGTCTCTGTA CATGTGTGTA CATATTAGAA AGGGAAGATG 2160 
TTAAGATATG TGGCCTGTGG GTTACACAGG GTGCCTGCAG CGGTAATATA TTTTAGAAAT 2220 
AATATATCAA ATAACTCAAC TAACTCCAAT TTTTAATCAA TTATTAATTT TTTTTTCTTT 2280 



' TTAAAGAGAA AGCAGGCTTT TCTAGACTTT 
TGTAGAGAGG AGCTTTGAGG CCACCCGCAC 
AGGACACTCA CGGCAGTTCT GGATCACCTG 
TGAAGAGGAA ACTCTGTCAC TCCTCATGCC 
TTCACAGGTT TTAAACTGGT TTTTTGCATA 
TATCTCTCCC CTCCCTCCCC TCCCCTTCTT 
CCCTCCATCT CAATCCCGTA TCTACGCACC 
GTATCACATC ACACAAAAGG AACAAAAGCG 
GGTTACTCAA AAGAACAAGA GTCAATGGTA 
GGAACCCACC AAACCAACCA ATCAACCAAA 
ATTTTGTCTT TTTGCATTTT GGTGTATAAG 
TTTAAAAAAA AAAATGTGGA GGAAAGTAGA 
AAATTCACAG ATTTTTTTAA CGAGAAAAAC 
CCTCAGCACC TTGCTCTTGT GTTTCCCTTA 
TTTTTTATTT TTTTAATAAA AATGAGTTGG 
GAGAAGGTGA CAGTCCAAGT GTGCAACAGC 
CCNATATGGC CTTCTTTTGG ACAAACCTTG 



AAAGAATAAA GTCTTTGGGA GGTCTCACGG 2340 
AAAATTCACC CAGAGGGAAA TCTCGTCGGA 2400 
TGTATGTCAA CAGAAGGGAT ACCGTCTCCT 24 60 
TGTCTAGCTC ATACACCCAT TTCTCTTTGC 2520 
CTGCTATATA ATTCTCTGTC TCTCTCTGTT 2580 
CTCCATCTCC ATTCTTTTGA ATTTCCTCAT 2640 
CCCCCCCCCC CAGGCAAAGC AGTGCTCTGA 2700 
AAACACACAA ACCAGCCTCA ACTTACACTT 27 60 
CTTGTCCTAG CGTTTTGGAA GAGGAAAACA 2820 
CAAAGAAAAA ATTCCACAAT GAAAGAATGT 2880 
CCATCAATAT TCAGCAAAAT GATTCCTTTC 2940 
AATTTACCAA GGTTGTTGGC CCAGGGCGTT 3000 
ACACAGAAGA AGCTACCTCA GGTGTTTTTA 3060 
GAGATTTTGT AAAGCTGATA GTTGGAGCAT 3120 
AAAAAAAATA AGATATCAAC TGCCAGCCTG 3180 
TGTTCTGAAT TGTCTTCCGC TAGCCAAGAA 324 0 
AAAATGTTTA TTT 3283 



<210> 9 
<211> 21 
<212> DNA 

<213> Homo sapiens 

<220> 

<400> 9 

gaagtatct tcaaggacgc c 21 



<210> 10 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 10 

ctgcaagggg ttttgctggg eg 22 



<210> 11 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 11 

tccttgcgcg ctgcggcctc ag 

<210> 12 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 12 

ccaactggtg gccattcagc ttc 



22 



23 



<210> 13 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 13 

gctctttggg gacaggaagg tc 22 



<210> 14 
<211> 22 



•<212> DNA 
<213> Homo sapiens 
<220> 
<400> 14 

gacgttgaca acggcggttt ct 

<210> 15 

<211> 12 

<212> DNA 

<213> artificial sequence 
<220> 

<221> adaptor 

<222> 1. . .12 

<400> 15 
aatttgcggt ga 

<210> 16 
<211> 12 
<212> DNA 

<213> artificial sequence 
<220> 

<221> adaptor 
222> 1. . .12 
'00> 16 
tttgttca tg 

BMo> 17 
1> 12 
v^±2> DNA 

<213> artificial sequence 
<220> 

<221> adaptor 
<222> 1. . .12 
<400> 17 

aattttccct eg 

<210> 18 
<211> 24 
<212> DNA 

<213> artificial sequence 
<220> 

<221> adaptor 
<222> 1. . .24 
<400> 18 

agcactctcc agcctctcac catg 

<210> 19 
<211> 24 
<212> DNA 

<213> artificial sequence 
<220> 

<221> adaptor 
<222> 1. . .24 
<400> 19 

accgacgtcg actatcatg catg 

<210> 20 
<211> 24 



*<212> DNA 

<213> artificial sequence 
<220> 

<221> adaptor 
<222> 1. . .24 
<400> 20 

aggcaactgt gctatccgag catg 

<210> 21 
<211> 8 
<212> DNA 

<213> artificial sequence 
<220> 

<221> adaptor 
<222> 1. . .8 
<400> 21 

gtgagagg 

<210> 22 

c211> 8 

212> DNA 

**13> artificial sequence 
20> 

^ : ;,^'1> adaptor 

W$m> 1...8 

■:^0> 22 

catggatg 

<210> 23 - 
<211> 8 
<212> DNA 

<213> artificial sequence 
<220> 

<221> adaptor 
<222> 1. . .8 
<400> 23 



ctcggata 



